CDS

Accession Number TCMCG075C29778
gbkey CDS
Protein Id XP_017984536.1
Location join(8005580..8005874,8005992..8006218,8006419..8006666,8006768..8007122,8007346..8007771)
Gene LOC18586663
GeneID 18586663
Organism Theobroma cacao

Protein

Length 516aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018129047.1
Definition PREDICTED: cytochrome P450 714C2 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category Q
Description cytochrome P450
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00199        [VIEW IN KEGG]
KEGG_ko ko:K20661        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGCGCTGCTATTGCTACTTGTCAAGATCACTGGCACAAGTGTGTTGATGGCATTCATTGGCATGCTCATACACCTGTTTGATTCGATGATTTTGAATCCTGCAAGGCTTCGTGCCAAACTGCGAAAGCAAGGAATCCGGGGTCCCCCTCCAACACTGTTGCTGGGAAATACCCTTGACATAAAGAAGACACAATCTAAGTTGTCGATGTTGCCGCAAGAAGGAGAACAAGTGATAACCCACAATAGTTCTTCCACTGTGTTTCCTTACTTCGAACAATGGAGAGAACAGCATGGCCCAACGTTTTTGTTTTCACTAGGCAACATACAGATTTTACACGTAACTGATCCTGATTTGGTGAAGGAAATAATTACATGCACCTCAATGGATTTGGGAAATCCTACGTACCAACAAAAGGAGCGAGGTCCTCTGCTTGGCAAAGGCATTCTAACTTCAAATGGTGCATTATGGGCACATCAAAGGAAAATCATTGCTCCTGAATTATACATGGACAAGGTCAAGGGTATGACGACCTTAATGGCAGACTGTTCTGTTATGGTGGTGAACGAGTGGAAAAGCAAGATTGACGGTGAGGGTGGAATTGCAGACATAAAGGTTGACGATTATTTGAGAAGGTTTACCAGAGATGTTATCTCAAGGGCTTGTTTTGGAAGCAATTATTCCCAAGGGGAAGAGATCTTCTTCAAGATTAGAGCTCTGCAAGAAGCCATGTCTAAAAAAGTTTTATCTAATGGGTTCCCTGGAATGAGATATCTTCCGACAAAAAGCAACAGGGAAATATGGAGGTTGGAGAAAGAAGTTCGGGCGTTAATCTTGAAGGCTGTGTATAAAACCAAGGAAGAAAAATCAAAGGAGGACCTATTACAAATGATCCTAAAAGGCGCTAAGAACAGTGATTTAGGCCCTGATGCAACAGATAACTTCATTGTTGACAACTGCAAGAATATATATTTTGCTGGGTATGAAACTACTGCTATTACAGCAGCTTGGACCTTGTTGCTGCTAGCCTTGAACCCAGATTGGCAAGAGAAAGTTCGTGCAGAGGTTCTTGAAATTTGTGGGGGCAAATTACCAGATGCCGACATGATCCGTAAGATGAAAGCACTAACGATGGTGATCAGTGAGACACTACGGCTATACCCTCCAGGCGCTATTATATCGAGGGAGGCCCTGGAAGATATGAAATTTGGAGATATTCATGTGCCTAAAGGAGTTAATATATGGTTACTGCCAGCGACACTTCACCAAGATCCTGAAATATGGGGACCTGATGCTGACAAATTCAATCCTGAAAGGTTTTCCAATGGAGTCAGTGGAGCCTGCAAGTTTCCTCATGTTTATTTGCCTTTCGGATTCGGACCCCATACATGTTTGGGACAGCATTTCGCCTTGGCAGAACTTAAGCTACTTCTTGCTCTTGCCCTGTCAAACTTCACATTCTCTCCCTCACCAAAATATAGGCATTGTCCATCTCTGAGTTTGATTATAGAGCCTAAACATGGAGTGAATCTCATAGTTAGGAGGCTGTGA
Protein:  
MALLLLLVKITGTSVLMAFIGMLIHLFDSMILNPARLRAKLRKQGIRGPPPTLLLGNTLDIKKTQSKLSMLPQEGEQVITHNSSSTVFPYFEQWREQHGPTFLFSLGNIQILHVTDPDLVKEIITCTSMDLGNPTYQQKERGPLLGKGILTSNGALWAHQRKIIAPELYMDKVKGMTTLMADCSVMVVNEWKSKIDGEGGIADIKVDDYLRRFTRDVISRACFGSNYSQGEEIFFKIRALQEAMSKKVLSNGFPGMRYLPTKSNREIWRLEKEVRALILKAVYKTKEEKSKEDLLQMILKGAKNSDLGPDATDNFIVDNCKNIYFAGYETTAITAAWTLLLLALNPDWQEKVRAEVLEICGGKLPDADMIRKMKALTMVISETLRLYPPGAIISREALEDMKFGDIHVPKGVNIWLLPATLHQDPEIWGPDADKFNPERFSNGVSGACKFPHVYLPFGFGPHTCLGQHFALAELKLLLALALSNFTFSPSPKYRHCPSLSLIIEPKHGVNLIVRRL